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ABSTRACT. We present a new proof of a theorem of Schur's from 1905 determining the least common multiple 
of the orders of all finite groups of complex n X n-matrices whose elements have traces in the field Q of rational 
numbers. The basic method of proof goes back to Minkowski and proceeds by reduction to the case of finite fields. 
For the most part, we work over an arbitrary number field rather than Q. The first half of the article is expository 
and is intended to be accessible to graduate students and advanced undergraduates. It gives a self-contained 
treatment, following Schur, over the field of rational numbers. 



1. Introduction 

1.1. How large can a finite group of complex n x n-matrices be if n is fixed? Put differently: if Q is 
a finite collection of invertible n x n-matrices over C such that the product of any two matrices in Q again 
belongs to Q, is there a bound on the possible cardinality \Q\, usually called the order of Ql Without further 
restrictions the answer to this question is of course negative. Indeed, the complex numbers contain all roots of 
unity; so there are arbitrarily large finite groups inside C*. Thinking of complex numbers as scalar matrices, 
we also obtain arbitrarily large finite groups of n x n-matrices over C. 

The situation changes when certain arithmetic conditions are imposed on the matrix group Q. When 
all matrices in Q have entries in the field Q rational numbers, Minkowski 1 33 1 has shown that the order of 
Q divides some explicit, and optimal, constant M(n) depending only on the matrix size n. Later, Schur 
1 39 1 improved on this result by showing that Minkowski's bound M(n) still works if only the traces of all 
matrices in Q are required to belong to Q. 

1.2. The first four sections of this article present full proofs of the theorems of Schur and Minkowski 
that depend on very few prerequisites. These sections follow Schur's approach via character theory and have 
been written with a readership of beginning graduate and advanced undergraduate students in mind. Provided 
the reader is willing to accept one simple fact concerning group representations (Fact 2 in Section FOl below). 
the proofs will be completely understandable with only a rudimentary knowledge of linear algebra, group 
theory (symmetric groups, Sylow's theorem), and some algebraic number theory (minimal polynomials, 
Galois groups of cyclotomic fields). The requisite background material will be reviewed in Sectionf3] 
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The material in Section [5] is new. We show that Minkowski's original approach used in |33| in fact 
also yields Schur's theorem |39|. Minkowski's method is conceptually very simple, and it quickly and 
elegantly explains why some bound on the order \Q\ must exist, even for arbitrary algebraic number fields, 
that is, finite extensions of Q. The method proceeds by reduction modulo suitably chosen primes and then 
using information about the orders of certain classical linear groups over finite fields. In fact, the general 
linear group alone almost suffices; only dealing with the 2-part of \Q\ using this strategy requires additional 
information. Since we work over algebraic number fields, a bit more mathematical background is assumed 
in this section. 

As of this writing, Schur's theorem first appeared in print exactly a century ago and Minkowski's goes 
even further back. In the final section of this article, we will survey some recent related work of Collins, Feit 
and Weisfeiler on finite groups of matrices, in particular on the so-called Jordan bound. We will also mention 
two mysterious coincidences concerning the Minkowski numbers M(n), one proven but unexplained, the 
other merely based on experimental evidence as of now. 

1.3. Minkowski |33| proved his remarkable theorem in the course of his investigation of quadratic 
forms. Stated in group theoretical terms, the theorem reads as follows. 

THEOREM 1 (Minkowski 1887). The least common multiple of the orders of all finite groups of n x n- 
matrices over Q is given by 



(1) M(n) = \{ 



Here, |_ . J denotes the greatest integer less than or equal to . and p runs over all primes. Note that if 
p > n + 1 then the corresponding factor in the product equals 1 and can be omitted. Therefore, Q is actually 
a finite product. The first few values of M(n) are: 

M(l) = 2 1 = 2 , M(2) = 2 2+1 3 1 = 24 , M(3) = 2 3+1 3 1 = 48 , M (4) = 2 4+2+1 3 2 5 1 = 5760 . 

1.4. For a positive integer m and a prime p, let m p denote the p-part of m, that is, the largest power 

of p dividing m. Thus, M(n) p — p^- p ~ 1 J + Lp(p-i) J + [ p 2 (p-i> J + "'. This number can be written in a more 
compact form. Indeed, the p-part of ml = 1 • 2 • . . . • m is given by 

(2) (m!) p =pLf J+[^J+- . 

To see this, put ml = y and note that ml = p ■ (2p) ■ . . . ■ (m!p) ■ (factors not divisible by p) . Therefore, 
(m\) p = p m {m'\) p and (0 follows by induction. Using (0 we can write 

(3) M(n) p =pl^\ ' 

1.5. The notation M (n), in the variant M n , was introduced by Schur in [39 1 to honor Minkowski who 
had originally denoted the same number by ri|. Relaxing the condition in Theorem^that all matrix entries 
be rational and replacing it with the weaker requirement that only the matrix traces belong to Q, Schur was 
able to prove that Minkowski's bound M(n) still works: 

Theorem 2 (Schur 1905). If Q is any finite group of n x n-matrices over C such that trace(g) G Q 
holds for all g G Q then the order of Q divides M(n). 

Schur's theorem covers a considerably larger class of groups than Theorem ^ ln |39|, the following 
example of a group covered by Theorem|2]but not Theorem^is given. 
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Example 3. Consider the matrices g = (° and h = (g where j = \/— T S C. Then 
g 2 = h 2 = — 12x2 and <?/i = (9 g) = — /ig. Thus Q = {±12x2, ±<7, ±5/1} is a group of complex 
2 x 2-matrices of order 8; it is isomorphic to the so-called quaternion group Q$. Note that the traces of all 
elements of Q are rational - they are either or ±2 - but Q certainly does not consist of matrices over Q. 
In fact, there does not even exist an invertible complex 2 x 2-matrix a such that the matrices x = aga -1 
and y = aha^ 1 both have entries in the field R of real numbers. To see this, note that x and y both would 
have determinant 1 and trace 0, as g and h do. A direct calculation shows that the product matrix z = xy 
then satisfies Z12 2 + X12 2 + j/12 2 = — X12Z/12 trace(z), where . 12 indicates the (1, 2)-entry of the matrix in 
question. However, trace(z) = trace(gh) = 0. Hence, if x and y are matrices over R then all terms on the 
left will be zero. But then 1 = det(x) = xwx^i = —xu 2 which is impossible. 

We remark in passing that, for any "irreducible" finite group Q of complex n x n-matrices, a necessary 
and sufficient condition for the existence of an invertible complex n x n-matrix a such that a<?a _1 is real for 
all g € Q is that 



The sum on the left is called the Frobenius-Schur indicator of Q; see, e.g., Isaacs 1 19 Chapter 4]. The group 
Q = Qs m the example above has Frobenius-Schur indicator — 1. 

1.6. The proof of Theorems [2 and [2] to be given in Section |4] below proceeds by first exhibiting suf- 
ficiently large groups of rational (in fact, integer) matrices showing that the least common multiple of the 
orders of all finite groups of n x n-matrices over Q must be at least equal to M(n). Thereafter, we may 
concentrate on Theorem|2]which in particular implies that the least common multiple in Theorem[2does not 
exceed M(n). Apart from updating terminology and notation to current usage and adding more generous 
details to the exposition, we have followed Schur's original approach in 1 39 1 quite closely. For a proof of 
Schur's theorem using slightly more sophisticated tools from representation theory, see Isaacs [19 Theorem 
14.19]. Stronger results are presented in Feit 1 16|. 

1.7. This article is dedicated to our friend and colleague Don Passman. Don's contributions to group 
theory and ring theory in general and his expository masterpieces |35|, |36| in particular have profoundly 
influenced our own work. In the course of various collaborations with Don, we have both benefitted from his 
deep insights and his generosity in sharing ideas. 

Notations. Throughout, GL„(i?) will denote the group of all invertible n x n-matrices over the 
commutative ring R. Recall that a matrix over R is invertible if and only if its determinant is an invertible 
element of R. 



The principal goal of this section is to construct certain groups of n x n-matrices over Z such that the 
least common multiple of their orders equals the Minkowski bound M (n) in Q. This will then allow us to 
give a reformulation of the core of Theorem|2] 

2.1. Construction of groups. The main building blocks of the construction will be the symmetric 
groups S r for various r. Recall that S r consists of all permutations of {1, . . . , r} and has order r! . 

PROPOSITION 4. Let a, m and n be positive integers with am < n. Then GL„(Z) has a subgroup Q of 
order \Q\ = {m + l)\ a a\. 
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Proof. If we can realize Q inside GL am (Z) then we can view Q as a subgroup of GL n (Z) via 

/ 



Q C GL Q 



GL am (Z) 



V 



C GL n (Z) . 



1 7 



Therefore, we may assume that n = am. Think of the rows of any n x n-matrix as partitioned into a blocks 
of m adjacent rows, and similarly for the columns. Now consider all matrices in GL„(Z) that have exactly 
one m x m-identity matrix l mX m in each block of rows and each block of columns and Os elsewhere; these 
are special permutation matrices. In fact, the collection of all these matrices forms a subgroup II C GL„(Z) 
that is isomorphic to the symmetric group S a ■ 



(- 



s a ~n = 



V 





1 m X ?ri 
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lm X m 
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Next, we turn to the symmetric group <S m +i. This group acts on the lattice Z m+1 by permuting its canon- 
ical basis e\ = (1, 0, . . . , 0), . . . , e m +i = (0, . . . , 0, 1) via cr(e;) = e a Q\. Note that this action maps the 
following sublattice to itself: 



An = {{Zi,. 



t) G Z" 



$> = 0}s 



(The notation A m comes from the theory of root systems; cf. J3J.) Thus, fixing some Z-basis of A rn , each 
permutation a € S m +i yields a matrix a G GL m (Z). It is easy to see that the map u i— > 5 is an injec- 
tive group homomorphism S m +i — > GL m (Z). Stringing each a-tuple (cti, . . . ,a^) along the diagonal in 
GL„(Z) we obtain a subgroup A C GL„(Z) that is isomorphic to S" n+1 : 



if 



<5m+l — Sm+i X • • • X S m +i = A — < 

a factors 



\ 1 



(To 



> C GL„( 



/J 



The subgroup II of GL n (Z) constructed earlier has only the identity matrix in common with A. Moreover, 
conjugating a matrix in A with a matrix from II simply permutes the ai -blocks along the diagonal. Therefore, 
defining Q to be the subgroup of GL„(Z) that is generated by II and A, we obtain 



\Q\ = |A||n| = (m + l)! a a! 



as desired. 



□ 



Now fix a prime p < n+1. Taking m = p — 1 and a = in Proposition0]we obtain a subgroup Q 

of GL„(Z) of order p! a a!; so \Q\ P — p a (a\) p . In view of 0, this says that \Q\ P = M(n) p , Letting p range 
over all primes < n+1, we have exhibited a collection of subgroups of GL„(Z) such that the least common 
multiple of their orders is M(n) . 
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2.2. Reformulation of Theorem|2| Let Q C GL n (C) be as in Theorem|2] Our goal is to show that, for 

all primes p, the p-part \Q\ p divides M(n) p = p a (al) p with a = ^ryj as in 0. Now Sylow's Theorem 

tells us that Q has subgroups of order \Q\ V , the so-called Sylow p-subgroups of Q. Replacing Q by one of its 
Sylow p-subgroups, the issue becomes to show that \Q \ divides p a al . Therefore, in order to prove Theorem[2] 
and thereby complete the proof of Theorem^ it suffices to establish the following proposition. 

PROPOSITION 5. Let Q be finite subgroup o/GL„(C) whose order is a p-power for some prime p and 
such that trace(<?) € Q holds for all g € Q. Then \Q\ divides p a a\ with a = ^rjj • 

3. Tools for the proof 

The proof of Proposition|5]will depend on three ingredients: a lemma to narrow down the possible trace 
values, some basic facts on characters of group representations, and an observation concerning the familiar 
Vandermonde matrix. We will discuss each of these topics in turn. 

3.1. Traces. This section uses a small amount of algebraic number theory. The book [22 1 by Janusz is 
a good background reference. 

Besides the usual matrix traces, we will use a notion of trace that is associated with field extensions. 
Specifically, let K/F be a finite Galois extension with Galois group T = G&\(K / F). Then the trace 
Tr K/F : K -> F is defined by Tr K/F (a) = £ 7 er l( a ) for a e K. If x m + ex™- 1 + ... is the min- 
imal polynomial of a over F then 

(4) Tr if/F (a) = -H. c . 

m 

This follows from the fact that the minimal polynomial of a is equal to JX"=i ( x ~ a i)> where {ai}™ are the 
distinct Galois conjugates 7(a) with 7 6 T. We will only be concerned with the special case where F = Q 
and K = Q(e 2m / pr ) with p prime. The Galois group of Q(e 27Tl ^ pr )/Q is isomorphic to the group of units 
{Z/p r Z)* of the ring Z/p r Z; its order is ip{p r ) = p r ^(p- 1). 

Lemma 6. Let g S GL„(C) be a matrix whose order is a power of p and such that trace(g) G Q. 
Then trace^) must be one of the values {n, n — p,n — 2p, . . . , n — ap}, where a = j^ryj • Moreover, 
trace^) = n holds only for g = l„ X n- 

Proof. By hypothesis, g p = l„ X n for some r. Let ei, . . . , e„ denote the eigenvalues of g; they are all 
powers of C = e 2 Wp r Hence, trace(g) = ^ Ei belongs to the subring Z[(] C C while also being rational, 
by hypothesis. This implies that trace(g) is actually an integer; see |22 Section 1.2]. Furthermore, by the 
triangle inequality, | trace(g)| < \ £ i\ = n an d < i s equality if and only if all are the same, that is, g is 
a scalar matrix. In particular, trace(g) = n holds only for g — l„ X n- 

Let p = (C — 1) denote the ideal of Z[Q] that is generated by the element Q — 1. So £ = 1 mod p, 
and hence all Si = 1 mod p and trace(<?) = n mod p. Therefore, trace(g) — n e p n Z = (p); see [22 
Theorem 1. 10.1] for the last equality. Since we have already shown that trace(g) < n, we conclude that 
trace(g) = n — pt for some non-negative integer t. It remains to show that t < -^j or, equivalently, 

trace(g) > 



p-1 



To this end, consider the Galois extension Q(£) /Q and its trace Trq^j /q. The minimal polynomial over Q of 
a root of unity of order p s > 1 is given by x pS (p-V + x p s - (p-2) + . . . + 1 

(|22 Theorem 1. 10.1] again). 
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Therefore, equation @ yields 



tp(p r ) if Si = 1 , 

p r ~ 1 if £j has order p , 
otherwise. 



Put rio = #{i | £j = 1} and n-i = #{i | £$ has order p}; so < n 
we obtain 



< n. Using the fact that trace(g) £ Q 
^(j/) trace (5) = Tr Q(c)/q (trace(3)) = ^ Tr Q(c)/q (e 4 ) = ip(p r )n Q - p r ^ 1 n 1 . 

i 

Hence, trace(g) — uq — > — as desired. □ 



3.2. Characters. A complex representation of a group ^ is a homomorphism p: Q — > GL(V) for some 
C-vector space V. If n = dime then we may identify GL(V) with GL„(C); the integer n is called the 
degree of the representation p. The character \ = X P of p is the complex-valued function on Q that is given 

by X(fl0 = trace(>0)) for 3 S 

Fact 1 77ze sum X) g gg JS always an integer that is divisible by \Q\. 
To see this, consider the linear operator e p € Endc(V) = M n (C) that is defined by e p 



Note that p(g)e p = e p holds for all g € G, because multiplication with p(g) simply permutes the summands 
of e p . Hence, e p is an idempotent operator: e 2 p = e p . Therefore, the trace of e p is equal to the rank of e p : 
trace(e p ) = dim c e p (V). On the other hand, trace(e p ) = ^ J2 g eG trace (/°(.9)) = J§\ E 9 es x(flO- Tms 
proves Fact 1. We remark that Fact 1 is a special case of the so-called orthogonality relations of characters. 

Fact 2 The product of any two characters ofQ is again a character ofQ. In particular, all 
powers x s (s > 0) of a character \ ore also characters ofQ. 

Here, the th power x° is the constant function with value 1; it is the character of the so-called trivial repre- 
sentation Q — > C* = GLi(C) sending every g € G to 1. In order to show that the product of two characters, 
X P and x P ', is itself a character, one needs to construct a complex representation of Q whose character is 
Xp • Xp' ■ This is achieved by the so-called tensor product p (g> p' of the representations p and p', a complex 
representation of degree equal to deg p ■ deg p 1 for whose detailed construction the reader is referred to Isaacs 
Il9l Chapter 4] or any other text on group representation theory. More generally, tensor products of repre- 
sentations can be defined for Hopf algebras; they form an important aspect of the current investigation of 
quantum groups. 

3.3. Vandermonde matrix. Given a collection z , . . . , z a of elements in some commutative ring R 
(later we will take R — Z), form the familiar Vandermonde matrix 



A 
1 



v = 



Zl 



a \ 

° \ 



V 1 Z a Z 2 a 



We will exhibit a matrix E over R so that the matrix product V ■ E is diagonal: 

/ \ 

(5) V ■ E — diag z - z s , Z\ - z S) . . . , z a - z s 

) 



0<s<a 
\ s*0 



0<s<a 
sjtl 



0<s<a 
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To this end, let e s = e s (xx, . . . , x a ) denote the s th elementary symmetric function in the commuting variables 
xi, . . . , x a . These functions can be defined by 

a a 

(6) n^-^^E 1 ^- 1 )"" 6 — 

i=l ,s=0 

where x is an additional commuting variable. Explicitly, e s = J2i Yliei Xi > wnere I mns over a U subsets 
/ C {1, . . . , a} with |/| = s. Specializing x to zt> and (xi, . . . , x a ) to (zo, . . . , Zt, . . . , z a ), where zt signals 
that Zt has been deleted from the list, and defining 

E = ((-l) Q - s e Q _ s (z , ...,£ t ,..., *a)), it=0f ... iO 

equation (|6} becomes the desired equation (|5} 

4. Schur's proof of Theorems[l]and|3] 

It remains to prove Proposition|5] So fix a prime p and let Q be finite subgroup of GL n (C) whose order 
\Q\ is a power of p. We assume that trace^) S Q holds for all g G Q. Since the order of each g divides \Q\, 
Lemma|6limplies that the traces trace^) can only take the values 

z t = n-pt with < t < a = ^ . 

Put rrit — #{5 <E G I trace(g) = z t }\ so = 1 by Lemma|6l Proposition [5] is the case t = of the 
following 

Claim. For all < t < a, the order \Q\ divides the product m t p a \ | s — t. 

0<s<a 

To prove this, note that the inclusion Q C GL„(C) is a complex representation of Q with character x(g) = 
trace(g). Therefore, it follows from Facts 1 and 2 above that, for each non-negative integer s, the sum 
S g ea trace((?)' s is an integerthat is divisible by \Q\. In other words, Ylt=o m t z t = m °d | ^ | or, in matrix 
form, 

(7) (m ,...,m a )-V=(0,...,0) mod \G\ , 

where V — {zf) t s=0 is the Vandermonde matrix, as in £13 . 31 Multiplying both sides of equation @ with 
the matrix E constructed in ^3.31 we deduce from equation l|5} that 

m t z t - z s = mod \G\ 

0<s<a 

holds for all < t < a. Since z t — z s = p(s — t), this is exactly what the claim states. This completes the 
proof of Proposition|5] and hence Theorems[2and|2are proved as well. 

5. Minkowski's reduction method 

Minkowski's original proof of Theorem[2is quite different from Schur's. The essential tool are reduc- 
tion homomorphisms to the general linear group over certain finite fields. The reduction method applies 
to algebraic number fields K, that is, finite extensions of Q, and very quickly yields rough bounds for the 
orders of all finite subgroups Q C GL n (K); see Proposition II II below. In fact, subgroups Q C GL„(C) 
satisfying only trace(g) S K for all g G Q can also be treated by this strategy due to the fact that linear 
groups over finite fields can be realized over the subfield generated by the traces; see Lemma [8] A sharp 
bound for the 2'-part of | | can be easily deduced in this way from the well-known order of the general linear 
group over a finite field together with some elementary number theoretic observations; see Proposition 1 151 
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The 2-part of \Q\ requires additional information concerning certain classical groups associated to hermitian 
or skew-hermitian forms. This will be explained in §§ l5.5l and l5.6l below. 

As usual, the field with q elements will be denoted by F q . We will also occasionally write the p-part of 
an integer to as m p = p v p( m \ and m p i will denote the p'-part of to; so m p i — m/m p . 

5.1. The general linear group over finite fields. It is well-known and easy to see that GL„(F g ) has 
order n"=o — 9 % )> c ^-' e -S-' R° tman 1 38 Theorem 8.5]. Thus, if q = p* then 

n 

(8) IGLn^lp^JJ^-l). 

i=l 

LEMMA 7. Let £ be an odd prime. There are infinitely many primes p such that 

|GL„(F p /)|,^ 1+ ^»Ln/rj ([„/,-]!), 

holds for all positive integers n and f, where r — Jjr5rjj' 

Proof. We use the fact that, for odd primes £, the group of units (Z/£ S Z)*~ of the ring Z/£ S Z is cyclic 
of order (p(£ s ) — £ s ~ 1 (£ — 1). Any integer whose residue class modulo £ 2 generates (Z/£ 2 Z)* will also 
generate the units modulo all powers £ s ; see 1 18 proof of Theorem 2 on p. 43]. Moreover, by Dirichlet's 
theorem on primes in arithmetic progression (e.g., |40| p. 61]), the residue class modulo £ 2 of any generator 
of (Z/£ 2 Z)* contains infinitely many primes p. Let p be one of these primes. Then p has order <p(£ s ) in 
(Z/£ S Z)*; so p % = 1 mod I s if and only if i is divisible by f(£ s ). In other words, £ divides p % — 1 if and 
only if £ — 1 divides i and, in this case, 



Now put q = pf. Then £ divides q % — 1 if and only if t divides i and, in this case, (q l — l)g = £ ft {i/r)i. 
For 1 < i < n, this applies to i = r, 2r, . . . , olt, where a = L ri / r J- Thus, | GL n (F p /)|£ = IULi^ 1 — ~ 
(£ fi) a (a\)i , which proves the lemma. □ 

We remark that, for / = 1, the expression £ ( 1 + 1 'p(/))L™/ r J ( [tt,/tJ !)^ in Lemma0is identical with the 
£-part of the Minkowski bound M(n); see equation (|5Jl. Thus, for an odd prime I, 

(9) |GL n (Fp)|^ = M(n)^ 

holds for infinitely many primes p. LemmaQfails for the prime 1 = 2, because the linear group is too big. 
For example, for all odd primes p, \ GL 2 (F p )| 2 = (p — 1)2 {p 2 — 1)2 is divisible by 16 while M (2) 2 = 8. 

LEMMA 8. Let Q be a finite subgroup o/GL„(F g ), where q = p*. Assume that p does not divide \Q\ and 
that p > n. If all g £ Q satisfy trace((?) € F for some subfield F C F 9 then Q is conjugate to a subgroup of 
GL n (F). 

Proof. Let Ik = F alg denote an algebraic closure of F with F q C Ik, and let a denote the canonical 
topological generator of Gal(lk/F) = Z. Then a acts on GL ra (lk) by (ffi,j)^ xn = (9ij) nxn - our 
hypothesis on traces, the map Q — > GL„(lk), g 1— > g a , is a Ik-representations of Q having the same character 
as the inclusion Q GL ra (lk). Since both representations are semisimple, by Maschke's theorem, they are 
isomorphic. (The proof of 1 4 § 12, Proposition 3] works in characteristic p > n.) Thus, there exists a matrix 
u G GL„(lk) such that ugu^ 1 = g a holds for all g S Q. By Lang's theorem |27|, we can write u = v a v~ 1 
for some v € GL„(lk). Thus, each v~ 1 gv is fixed by a, and hence it belongs to GL n (F). By the Noether- 
Deuring Theorem (e.g., Curtis-Reiner |21 P- 139]), we may replace v by a matrix in GL„(F g ), proving the 
lemma. □ 
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REMARKS, (a) Lang's theorem is a much more general result than what is actually needed for the proof 
of Lemma[8] see, e.g., Borel |2 Corollary 16.4]. Indeed, we only invoke the theorem for the algebraic group 
GL n and, in this case, it is a special case of Speiser's version of Hilbert's Theorem 90: the Galois cohomology 
set -ff^F, GL„) is trivial for every field F; cf. Serre |41 Proposition X.3] or Knus et. al. EU Remark 29.3]. 
For a finite field F, triviality of H 1 (F, GL„) amounts to the desired fact that every u € GL n (F alg ) can be 
written as u = ifv^ 1 , where a is the Frobenius generator of G&\(F' d lg /F); see [25 Exercise 2 on p. 442]. 

(b)Itfollowsfrom(a)thati? 1 (F 9 ,PGL„)isti-ivialaswell: every U g PGL n (¥f g ) = GL„(F^ lg )/(D r (3 alg )* 
can be written as U — V a V~ 1 for some V g PGL„(Fg ). Moreover, triviality of iT 1 (F g , PGL n ) is equiv- 
alent to Wedderburn's commutativity theorem for finite division rings; see |41 Proposition X.8] or |25 
p. 396]. For an alternative proof of a version of Lemma[3]based on Wedderburn's commutativity theorem, 
see Isaacs 1 19 Theorem 9. 14]. Incidentally, Wedderburn's article |44| appeared in 1905, as did Schur's, and 
Speiser's generalization of Hilbert's Theorem 90 appeared in 1919 |43 Satz 1]. None of this was available 
to Minkowski when 1 33 1 was written. 

5.2. The reduction map. Throughout this section, K will denote an algebraic number field and Q will 
be a finite subgroup of Q\j n [K\ Furthermore, O = Ok will denote the ring of algebraic integers in K. 

Put L = J2 g eg 9 ' O n c tm ^ * s a ^-stable finitely generated O-submodule of K n . If O is a 
principal ideal domain (or, put differently, K has class number 1) then the theory of modules over PIDs tells 
us that L is isomorphic to O n ; see, e.g., Jacobson |20 Section 3.8]. Therefore: 

If O = Ok is a PID then Q is conjugate in Gh n (K) to a subgroup q/GL„(C). 

For K = Q, for example, this says that every finite subgroup of GL„(Q) can be conjugated into GL„(Z). 
This explains why it was enough to look at integer matrices rather than matrices over Q in Section|2] 

In general, O is a Dedekind domain and the foregoing applies "locally": for every prime ideal p of O, 
the localization O v is a PID; see Jacobson |21 Section 10.2]. Consequently, as above, we may conclude that 
G is conjugate in GL n (K) to a subgroup of GL„(O p ), and hence we may assume that Q C GL„(O p ) after 
replacing Q by a conjugate. In fact, except for finitely many primes of O, the group Q is actually contained 
in GL„(O p ) at the outset: if a £ O is a common denominator for all matrix entries of all elements of 
the original Q then Q C GL n (C[l/a]); so any prime p not containing a will do. Now let p ^ and put 
(p) = p fl Z. Then O/p is a finite field of characteristic p. The number of elements of O/p is often called 
the absolute or counting norm of p; it will be denoted by Af{p). Thus, 

O/p = f m) and Af(p) = pf , 

where / = /(p/Q) is the relative degree of p over Q. Reduction of all matrix entries modulo the maximal 
ideal pO p of O p gives a homomorphism 

(10) Gh n (O p ) -» GL„(Fv(p)) , 

because O p /pO v = O/p. The following lemma is well-known. Only the first assertion will be needed later; 
the second is included for its own sake. Recall that, since O v is a local PID, its non-zero ideals are exactly the 
powers of the maximal ideal pOp. The ramification index of p over Q is the power e such that pO p = p e O P . 

LEMMA 9. The kernel of the reduction homomorphism dlOi has at most p-torsion. In fact, any torsion 
element g in the kernel satisfies g p = l n xnfor some p 1 < ep/(p — 1). 

Proof. For each g e GL„(O p ), define d(g) — sup{m | g — l nxn e M„(p m O p )}; so d(g) = oo 
if and only if g = l nx „ and d(g) > if and only if g belongs to the kernel of (II 0i . Now assume that 
< d = d(g) < oo and write g = l nxn + TT d h, where tt is a generator of the ideal pO p and h G 
M„(O p ) \ M„(pO p ). Then g r = l nxn + n d {rh + s) with a = Ei=2 Q^^h 1 g M„(pO p ). If 
(r,p) = 1 then rh + s ^ M„(pO p ) and so g r ^ l n xn- This shows that the kernel of (II 01 has at most 
p-torsion. 
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We claim that any g € GL„(O p ) with d = d(g) > satisfies d(g p ) > min{e+d, pd}, and d(g p ) = e+d 
if pd > e + d. Indeed, we may assume that d < oo. Writing g = l nxn + 7rd ' 1 be as above, we obtain 
3 P = lnxn + 7r dp h p + t with i = X)i=i \^)^ dt h l . Since p divides all binomial coefficients ( p ) occurring in 
t, we have i e M n (p e+d O p ) \ M n (p e+d+1 O p ). The claim follows from this. We conclude in particular that 
g p ^ lnxn if oo > {p - l)d > e. 

Now assume that g € GL„(0p) is a torsion-element with < d(g) < oo. Then g p = l nxn for 
some positive integer i. If i is chosen minimal then our observations in the previous paragraph imply that 
e > (p — l)d(g p ) > (p — l)p l ~ l d{g). Hence, p 1 < ep/ (p — 1) which proves our second assertion. □ 

The above proof also shows that if m(p — 1) > e then there is no non-trivial torsion in the kernel of the 
homomorphism GL„(O p ) — » GL„(0/p m ) that is defined by reduction of all matrix entries modulo p m O p . 

Example 10. Let K = Q. Then p = (p) and e = 1. Thus, in Lemma[9] we must have i = when 
p is an odd prime, and i < 1 when p = 2. In other words, the kernel of the reduction map GL n (Z( p >) — > 
GL n (F p ) is torsion-free for odd p. For p = 2, the only non-trivial torsion possible is order 2. The kernel of 

GL„(Z( 2 )) -> GL„(Z/4Z) is torsion-free. 

The first assertion of Lemma|9]implies that the p'-part \G\ P * of the order of Q divides | GL n (Fjv'(p))|f)'. 
In view of equation (|8}, this yields the following proposition. 

PROPOSITION 1 1 . Let Q be a finite subgroup of QL n (K), where K is an algebraic number field. Then, 
for each non-zero prime p of Ok tying over p 6 Z, \Q\ P > divides Y[i=i Q^iPT ~ 1) • 

Applying Proposition^Jwith an Y two choices of p lying over different rational primes yields a bound 
for the order of Q. Moreover, Proposition^2 comes close to establishing the Minkowski bound M(n) for the 
field of rational numbers: 

Example 12. For a finite subgroup Q C GL n (Q) and a given prime I, Propositionll llimplies that the 
£-part \Q\i of the order of Q divides | GL n (F p )|^, where p is any prime other than I. Furthermore, if I ^ 2 
then | GL„(F P )|^ = M(n)i for infinitely many primes p, by J9j. Thus, we have shown (again) that if Q is a 
finite subgroup of GL„(Q) then \Q\i divides M(n)i for all primes £ ^ 2. In order to extend this to the prime 
t = 2, Minkowski uses additional facts about quadratic forms. This will be explained below. 

5.3. The Schur bound. Fix an algebraic number field K. We will describe certain constants S(n, K), 
introduced by Schur in |39|, for the purpose of extending Theorem|2]to general algebraic number fields. 
Thus, S(n, Q) will be identical to M (n). Like M (n), the constant S(n, K) will be defined as a product of 
^-factors for all prime numbers £, and almost all ^-factors will be 1. Throughout, we put 

Cm - e 2 ™/™ e C . 

For a given prime £, the chain K n Q(Q) C • • • C K n Q(Ce^) Q K n Q(Q m +i) C ... of subfields of 
K must stabilize, since K is finite over Q. Thus we may define 

(11) m(K,l) = min{m > 1 | KnQ(Q m ) = K n = ...}. 
Now put 

(12) t(K, i) = mQ m <K,v ):Kn Q(6m(K. fl )] . 
and define 

(13) S(n,K) = 2™ _ L'T^2tJ J^^ m( ^^LtTRTiTJ + Ltt(^,*) J + |_f2t(lf,£)J +••• 

t 

i 1 



FINITE MATRIX GROUPS 



II 



Here, £ runs over all rational primes, including 2, and the second equality follows from equation (0. Since 
t(K, £) [K : Q] > i — 1, only finitely many £ will satisfy £(A, £) < n and so almost all ^-factors are trivial. 

Example 13. Let A' = Q(Cfc) for some positive integer k. Since Q(0t) n Q(Ct) = Q(C(fc.t))> we nave 
m(A, £) = max{l, i^(fc)}- If I does not divide k then f(A, £) = I - 1; otherwise t(A, £) = 1. For A = Q 
in particular, we obtain m(Q, £) = 1 and i(Q, £) = £ — 1 for all I, Thus, equation Jl 31 reduces to Q and so 

S(n, Q) = M(n). 

In 1 39 1, Schur proved the following generalization of Theorem[2]using a larger dose of character theory 
than what was needed in Section|4] 

THEOREM 14 (Schur 1905). Let Q be a finite subgroup q/GL„(C) such that the traces of all elements 
of Q belong to some fixed algebraic number field K. Then \Q\ divides S(n, A). 

An alternative description of the constants S(n, A) is as follows. Let pn^ denote the group of all 
£-power complex roots of unity. Then A n Q(/i£»° ) = A fl Q(C^m(s:,«) ). 

• If i is odd then each A n )/Q is a subextension of ) / Q which is cyclic with Galois group 
isomorphic to (Z/£Z)* ^ Z/r* _1 Z x Z/{£ - 1)Z. Also, A n is the fixed subfield of A n Q(O-) 
under the group Z/i^Z. Thus, [AnQ(O-) : Q] = [AnQ(O) : Q}[KnQ(( e ™) : Q]i and [AnQ(O) : Q] 
is a divisor of £ — 1 . Hence, for odd primes £, 

(14) m(K,£) = 1 + t^([A n Q(/X£oc) : Q]) 

*(A,£) = [Q(0) : A n Q(Ce)] = (i-i,[jrnQ(/xioo):Q]) ■ 

• For the prime £ = 2, the extension Q(C 2 ™)/Q has Galois group (Z/2 m Z)* = Z/2 m ~ 2 Z x Z/2Z 
(m > 2). The factor Z/2Z is generated by complex conjugation. When m > 2, the field Q(C2 m ) has 
exactly three subfields that are not contained in Q(C2 m ~ 1 ) : besides Q(C2 m ), there are Q(C2 m + C2™) an( ^ 
(Q(C 2 ™-C 2 ^)- tf*(-K",2) = 1, which certainly holds when m( A, 2) = lorm(A,2) = 2, then AnQ(/Lt 2 <») = 
Q(C 2 ™(if,2)),andso [AnQ(jU 2 ~) : Q] = 2 m ( K > 2 )- 1 . If t(A, 2) 7^ 1 then An Q(jU 2 ~) must be equal to either 
Q(W, 2 > + C^W or Q(C 2m( *,2) - C^yr, a) ). Thus, i(A,2) - 2 and [A n Qfaa-) : Q] = 2 m ^~ 2 . 
In either case, the 2-factor of S(n, A) in ( II 31 simplifies to 

(15) S(n, A) 2 = [A n Q(/x 2 oc ) : Q] Lot J y ( n !) a 
The following properties of S^n, A) are easy to verify: 

(16) S(m, K)S(n, A) divides S(m + n, A) 
and 

(17) S(n, A) divides S(n, F) if A C A. 

5.4. Odd primes. The following proposition establishes Theoremll4lfor the 2'-part of |<7|. The special 
case where A = Q was done earlier in ExamplefL?! 

PROPOSITION 15. Let Q be a finite subgroup o/GL n (C). Assume that the traces of all elements of Q 
belong to some algebraic number field A. Then \G\t divides S(n, K)for all odd primes £. 

Proof. Replacing Q by a conjugate in GL n (C) if necessary, we can make sure that Q C GL„(F) for 
some algebraic number field ADA. Indeed, any splitting field for Q that is finite over A will serve this 
purpose; see |19 Theorem 9.9]. Let O — Of denote the ring of algebraic integers of F and consider any 
non-zero prime p of O such that Q C GL„(0 P ) and £ p. Put (p) =p(lZ and assume that p is chosen as 
in Lemma0and also satisfies p > n. Let p: Q — > GL n (Fjv , (p)) denote the reduction homomorphism i ll Ob 
restricted to Q. Upon replacing £ by a Sylow ^-subgroup, the map p becomes injective, by Lemma[9] and 
our goal now is to show that \Q\ divides S(n, K)e. 
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As in the first paragraph of the proof of Lemma[6] one sees that the traces of all elements of Q actually 
belong to the ring of algebraic integers Ok' of the field K 1 ~ K f] Q(/i^°°). Therefore, trace(p(p)) S ¥ q 
holds for all g 6 Q, where q = J\f(p n Ok 1 ) — P ■ Lemma[8]now implies that p(Q) is conjugate to a 
subgroup of GL n (F q ) and LemmaQfurther gives that 

\Q\ divides |GL n (F,)| £ = *(i+»*(/»L?J (|fiji)^ , 

where r = jtEyj) ■ Now, for odd I, 

S(n,K)t = r [K ^m\ ( 

withm(if,^) = l + ^([XnQ(/x £ oo) : Q]) and t(K, £) = ^ 1 nffWjil^l Q ] ) bv (fl4l. Since the residue class 
of p generates (Z/£ S Z)* for all s, p remains prime in Z see the proof of Lemma0and |18 Theorem 
2 on p. 196]. In particular, p remains prime in Ok 1 , and so / = /(p n Ok'/Q) = [K f] Q(^<*>) : Q]. 
Therefore, | GL n (F (? )|£ = S(n, K)i and the proposition is proved. □ 

5.5. Unitary, orthogonal and symplectic groups. In this section, we review some standard facts about 
hermitian and skew-hermitian forms and certain classical groups that are associated with them. Throughout, 
Ik will denote a field and a i— ► a 6 will be an automorphism of Ik satisfying 9 2 = Id. We assume for simplicity 
that char Ik ^ 2. 

5.5.1. Sesquilinear forms. Let V denote an ?i-dimensional vector space over Ik. A bi-additive map 
f3: V x V — > Ik is called sesquilinear (with respect to 0) if 

/3(av, bw) = ab e (3(v, w) 

holds for all v,w £ V and a, b € Ik. When 9 is the identity, sesquilinear forms are ordinary bilinear 
forms. A sesquilinear form (3 is called non-singular if j3 satisfies the following equivalent conditions: (i) 
(3(v, V) = {0} for v E V implies v = 0; (ii) (3(V, v) = {0} for v € V implies v = 0; (iii) for any basis 
{v\, . . . , v n } of V, the matrix ({3(vi, Vj)) has non-zero determinant; see |29 Proposition XIII.7.2]. If j3 
is any sesquilinear form on V and g 6 GL(y) then, defining /3 ff (w, «') := (3{g{v), g(v')) for w, u' £ V, one 
again obtains a sesquilinear form f3 9 on V with respect to 9; it is called equivalent to /3. 

Sesquilinear forms (3 satisfying (3(w,v) — [3{v,w) e (resp. (3(w,v) = —(3(v,w) e ) for all v,w E V 
are called hermitian (resp. skew-hermitian). The stabilizer in GL(V) of a non-singular hermitian or skew- 
hermitian form (3 is called the group of isometries of (V, /?) and is denoted by Iso(V, so 

Iso(V, P) = {gE GL(V) | /3(ff(«), <?(«')) = «') for all v, v' e V} . 

Let (3 be non-singular skew-hermitian. If j3{v, v) ^ for some uel' then (3' = (3(v, v)j3 is a non-singular 
hermitian form on V with Iso(V, (3') = lso(V, (3). On the other hand, if j3(v, v) — for allvEV then it is 
easy to see that 9 — Id and so (3 is an alternating bilinear form. Therefore, when studying isometry groups 
of non-singular hermitian or skew-hermitian forms (3 on V, it suffices to consider the following cases: 

unitary case: (3 is hermitian with respect to 9 ^ Id; 
orthogonal case: (3 is symmetric bilinear (9 = Id); 
symplectic case: [3 is alternating bilinear (9 = Id). 

5.5.2. Twisting modules. Now assume that V is a finitely generated (left) Ik [Q] -module, where Q is a 
finite group. We let V e = {v e | v € V} denote a copy of V with operations 

u e + w 9 = (v + w) 6 , {avf = a e v e and gv e = (gv) e 

for v, w £ V, a S Ik and g G 5. Then becomes a lk[C?]-module and 

(18) traceye/Jg) = (trace v/k (g)) 
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holds for all g <E Q. Furthermore, there is an isomorphism of Ik [Q] -modules 

(19) (V (8ik V 9 )* = {sesquilinear forms V x V — > Ik with respect to 9} . 

The isomorphism sends a linear for ip: V <E>k V 9 — > Ik to the form ip: V x V ^ k given by <p{v,w) = 
ip(v <8> w ). The group S2 = (r) acts on the space of sesquilinear forms /3: x y — > Ik with respect to # by 

(t0)(u,«;) =/3( V , W ) 9 

for u, w G V. This action commutes with the action of Q. Note however that the action is only Ik-semilinear: 
t(cl(3) = a e r(3. Clearly, j3 is hermitian (resp. skew-hermitian) if and only if t(3 = f3 (resp. t(3 = —f3). 

Lemma 16. Let a: Q —> GL(V) be an irreducible representation of the finite group Q. IfV* = V 9 as 
k[Q]-modules then cr(Q) C Iso(y, 13) for some non-singular form (3 onV that is hermitian or skew-hermitian 
with respect to 9. 

Proof. Since V* = V 9 , we have V* ® k V = (V" ® k V 9 ) * and so 

Endj < (V A ) = {sesquilinear forms V x V — > Ik with respect to 9} 

as lk[^]-modules, by dl 91 . The identity Idy G Endj < (V r ) therefore corresponds to a non-zero Cf-invariant 
sesquilinear form (3. Write (3 = /3+ + /3„ with f3± = \{1 ± where £>2 = (t) as above. Then 

t(3± = ±/3; so (3 + is hermitian and /3_ is skew-hermitian with respect to 9, and at least one of them is 
non-zero. Moreover, both (3± are ^-invariant, since the actions of r and commute. Finally, any non-zero 
Q -invariant hermitian or skew-hermitian form on V is non-singular, because its radical is a proper k[Q]- 
submodule of V, and hence it must be zero because V is assumed simple. □ 

5.5.3. Isometry groups over finite fields. We will now concentrate on the case of a finite field Ik = F q of 
order q = p' for some odd prime p. Let (3 be a non-singular hermitian or skew-hermitian form on V = F™. 
Since we are only interested in the group of isometries Iso( V, (3), we may assume that (3 is unitary, orthogonal 
or symplectic. The orders of these groups are classical; see Dieudonne |13| or Artin Section III.6], for 
example. The original sources are Minkowski's dissertation |32| and Dickson |12|. 

unitary case: Since 9 has order 2 in this case, / must be even. Moreover, (3 is unique up to equiva- 
lence, and so Iso( V, (3) is determined up to conjugation. The order of Iso( V, (3) is 

n 

(20) I Iso(V,/3)| =p/»(«- 1 )/ 4 JJ( p / < /2 _ (_!)«) . 

i=l 

symplectic case: Again, (3 is unique up to equivalence. The dimension n must be even. One has 

n/2 

(21) llso^,/?)!^ 2 / 4 ]^ 21 -!)- 

i=l 

orthogonal case: Here, the order of Iso( V, (3) is given by 

(n-l)/2 

2 (? (™-i) 2 /4 Y[ (<7 24 -l) if n is odd, 

(22, |iso ( i-..;i| = «{ <=1 (n _ 2)/2 

2g n(n-2)/4( g «/2 _ £ ) J| (g 2i - 1) if n is even, 

i=i 

where e = ±1 depends on the form j3. The detailed description of e will not matter for us. 

LEMMA 17. Let K be an algebraic number field contained in Q(/X2°c ) (so K is Galois over (Q and in 
particular stable under complex conjugation). If K (j- R then assume that t(K, 2) = 1. There are infinitely 
many odd primes p of the ring of algebraic integers Ok satisfying the following two conditions: 
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(i) p is stable under complex conjugation, and 

(ii) If [3 is any non-singular hermitian or skew-he rmitian form on V = F™ with respect to the au- 
tomorphism 9 of Ok/P = Fq that is afforded by complex conjugation then | Iso(V,/3)| 2 divides 
S(n,K) 2 . 

PROOF. We will need the following elementary observation. If p is a prime satisfying p = — 1 + 2 k 
mod 2 k+1 for some k > 2 then, for all positive integers i, 

(23) (p* - {-l) l ) 2 = 2 k i 2 . 

To see this, we remark first that (p l — 1)2 = 2 holds for oddz, because the residue class of p modulo 4 is the 
nonidentity element of (Z/4Z)*, and hence the same holds for all odd powers of p. Moreover, since p 2 = 1 
mod 2 k+1 , we have p 1 = 1 mod 2 k+1 for all even i, and hence (jp % + 1)2 = 2. Now, to prove J23L assume 
first that i is odd, say i = 2 j + 1. Then the foregoing implies that p 1 — (— l) 1 = p 2 ip + 1 = p + 1 = 2 k 
mod 2 k+1 , and so (p 1 — (— l) l ) 2 = 2 , proving d23l for odd values of i. Finally, assume that i = 2j. Then 
p 1 - (— I) 1 = (p J — + 1). If j is odd then we know that (p> + 1) 2 = 2 k and (p> — 1) 2 = 2, and hence 
(p 1 — (— l) l ) 2 = 2 k+1 , as desired. When j is even then (pi — 1) 2 = 2 fe j 2 , by induction, and (jp + 1) 2 = 2, 
as we remarked earlier. Thus, ( I23l > is proved in all cases. 

Turning to the proof of the lemma, note that Kj (Q is Galois, being a subextension of the abelian extension 
Q(/x 2 oo ) /Q. Put m = m(if , 2), t = t(K, 2) and C = C2™ ■ Then O becomes 

5(n,X) 2 = [X :Q]L"/tJ 2 «(n!) 2 

and K is one of the fields Q(£) or Q(£ + C see EH We will deal with each of these cases separately. 
Throughout, p will denote a prime ideal of Ok and we put q = 7V(p) and (p) = p n Z. 

First consider the case where K is real. Then property (i) is automatic and Iso(V, (3) is symplectic or 
orthogonal. Replacing the factor (q n / 2 —e) in formula J22b for even n by its multiple (g n / 2 — e)(q n ^ 2 +e)/2 = 

(q n — l)/2 and deleting g-factors (which are odd) we obtain the expression n"=i (l 2t ~ 1) mat on ly depends 
on n and q and is identical to (12 1 1 stripped of its g-factors. Put 

(n-l)/2 

2 ]Q (g 2l -l) if n is odd, 



o(n, q) = < 



n/2 



Y\_(q 21 - 1) if n is even. 



Now g = pf, where / = [Oa'/P : ">] is a divisor of [K : Q]; so / is a power of 2. Choose p to lie over any 
rational prime p with p = 3 mod 8. Then (I23> with k = 2 implies that the 2-part of q 2t — 1 for i > 1 is 
given by (q 2i — l) = 8/12- It follows that the 2-part of o(n, q) can be written as o(n, q) 2 = /L"/ 2 J 2™(n!) 



2 



in both cases. Since / is a divisor of [K : Q] and t equals 1 or 2, we see that 0(71,5)2 divides S(n,K) 2 
which settles the symplectic and orthogonal cases. 

Next, let K — with m > 2. Choose p to lie over any rational prime p satisfying p = — 1 + 2 m 
mod 2 m+1 . The p is stable under complex conjugation. Indeed, the decomposition group of p is generated 
by the automorphism of K sending £ to Cf (cf., e.g., 1181 Corollary on p. 197]), and our choice of p implies 
that <^ p = (p 1 = Thus, complex conjugation belongs to the decomposition group of p, and it must in 
fact generated the decomposition group, because is not a square in Gal(i^/Q). Since p is unramified over 
Q, its relative degree over Q equals / = 2; so q = p 2 . Therefore, i20\ and i23\ give 



|Iso(F,/?)| 2 = Y[(p l - (-1)% = 2 mn (nl) 2 = 2(™- 1 )"2"(n!) 2 . 

Since [K : Q] = 2' Tl ~ 1 and t = 1, the last expression is equal to S(n, K) 2 , thereby completing the proof of 
the lemma. □ 
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The lemma fails in the excluded case K R, t(K,2) = 2. For example, let if = Q(-\/— 2). Then 

m(if, 2) = 3 and t(K, 2) = 2 and so S(n, if ) 2 = 2^ J 2 n (n!) 2 . On the other hand, if p is an odd prime 
of Ok that is stable under complex conjugation, then /(p/Q) = 2 and p = — 1 mod 8. It follows that 
| Iso(V, /3)| 2 = - (-!) l )2 is divisible by 2 3 ™ which is too big. 

5.6. The prime 1 = 2. The following proposition complements Proposition [21 It would be nice to 
remove the restrictions if' = if fl Q(/^ 2 °°) C R or t(K, 2) = 1 on if. This would require replacing the 
isometry groups Iso(V, (3) by suitable subgroups. 

PROPOSITION 18. Let Q be a finite subgroup o/GL n (C) such that the traces of all elements ofQ belong 
to some fixed algebraic number field if. Assume that if' — if n Q(/x 2 =° ) Q R or f (if, 2) = 1. Then |0| 2 
divides S(n, if). 

Proof. We may assume that Q is a 2-group. Therefore, trace(g) G Ok 1 for all g e C/. Replacing if 
by if', we may assume that if = if' C Q(//2«>); see dl7> . Choose a prime p of Ox as in LemmalT7land put 
q = Af(p). As in the proof of Proposition [21 we can arrange that Q C GL n (F) for some algebraic number 
field F containing if. Choose a prime of O — Of lying over p and put Ik = O/ty; so ¥ q = Ok/P C Ik. 
We may assume Q C GL„(Oqj) and that (p) = Cp n Z satisfies p > n. By Lemma |9j the reduction 
homomorphism GL„(0fp) — * GL„(lk) is injective on Q. We will write the restriction of this map to Q as 

p: Q — GL„(B<) . 

Then trace p(g) = trace g mod p E F q C Ik for g e C/, and trace pig' 1 ) = (trace p(g)) 6 , where 9 denotes 
the automorphism of ¥ q that is afforded by complex conjugation, as in Lemma flTl Now Lemma[8]implies 
that p(G) v = v~ 1 p(Q)v C GL„(F 9 ) for some v € GL„(lk); so we may consider the representation 

<r = (.) v op:g^GL(V), 

where V = ¥ q . Note that trace a(g) = trace p(g) for all g E Q. We will write 1/ as a direct sum of 
F 9 [t/]-submodules Ui on which Q acts as a subgroup of Iso(C/j, /3j) for some non-singular hermitian or skew- 
hermitian form /3j with respect to # on fT*. This will imply that \Q\ divides J\ i | lso(Ui, ft)| 2 , and hence 
|£| divides T] t ^dimf/*, AT) by Lemma[T7] Since f], 5(dimt/i, if) is a divisor of S(jr. dim[/ 4 , if) = 
S'(n, if), by ( I16K the theorem will follow. 

To achieve the decomposition of V, recall that trace o(g~ l ) = (trace a \g)f for all g G Q. By dl8> . 
this says that the F g [0] -modules and V s have the same character, and hence they are isomorphic; see 
the proof of Lemma |8] Write V = 0. V t {n ° with non-isomorphic irreducible F q [G] -modules V,. Then 

V* = 4 (^*) (n<) and V s t (Vf) (n<) . For each i, there is an i> so that V^* £* Vj?. If i = i' then 
Lemma H6l says that Q acts on as a subgroup of Iso(Vi, /3j) for some non-singular hermitian or skew- 
hermitian form on Vi. Now assume that i ^ i' . Then V* © Vf is a direct summand of V e , and hence 
= \Yi) © ^ is a direct summand of V. Defining 

Pi(f e + vJ' e + v')=f(v') e + f(v) 

for /, /' S V^* and u, d' S 14 we obtain a non-singular hermitian form on Vi that is preserved by the action 
of Q. This yields the desired decomposition of V and completes the proof of the theorem. □ 
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6. Outlook 

We conclude by surveying, without proofs, a number of topics that are related to the foregoing. 
6.1. The largest groups and recent work on the Jordan bound. 

6.1.1. The group Q constructed in Proposition's isomorphic to the so-called wreath product 

Sm+l I S a . 

By definition, S m +i I S a is the semidirect product of S^ +1 xi S a , where S a acts on S% 1+1 — S m +i x • ■ ■ x 
Sm+i by permuting the a factors S m +i. The special case m = 1 yields the group {±1} I S n , a subgroup 
of GL„(Z) order 2™n! which is also known as the automorphism group Aut(i?„) of the root system of type 
B n ; see |3|. For almost all values of n, these particular groups turn out to be the largest finite groups that 
can be found inside GL„(Z), and even inside GL„(Q) (see £15.21 . Indeed, Feit |15| has shown that, for all 
n > 10 and for n = 1,3, 5, the finite subgroups of GL„(Q) of largest order are precisely the conjugates 
of Aut(_B„). For the remaining values of n, Feit also characterizes the largest finite subgroups of GL„(Q) 
and shows that they are unique up to conjugacy. Feit's proof depends in an essential way on an unfinished 
manuscript of Weisfeiler [45 1 which establishes an estimate for the so-called Jordan bound; see § 16.1.21 
below. An alternative proof of Feit's theorem for sufficiently large values of n has been given by Friedland 
1 17 1 who relies on another (published) article of Weisfeiler's, |46|. Both |45| and |46| depend crucially on 
the classification of finite simple groups. 

Sadly, the two protagonists of the developments sketched above are no longer with us: Walter Feit passed 
away on July 29, 2004 while Boris Weisfeiler disappeared in January 1985 during a hiking trip in the Chilean 
Andes. The present status of the investigation into Weisfeiler's disappearance is documented on the web site 
http://www.weisfeiler.com/boris/. For further information on the subject of finite subgroups of GL„(Z) and 
of GL„(Q), especially maximal ones, see, e.g., Nebe and Plesken |34|, Plesken |37|, the first chapter of 1 31 1 
and, at a more elementary level, the article |26| by Kuzmanovich and Pavlichenkov. 

6.1.2. The Jordan bound comes from the following classical result |23|. 

THEOREM 19 (Jordan 1878). There exists a function j: N — > N such that every finite subgroup of 
GL„(C) contains an abelian normal subgroup of index at most j{n). 

Early estimates for the optimal function j(n) were quite astronomical. Until fairly recently, the best 
known result was due to Blichfeldt: j(n) < nl Q( n - 1 )( n ( n + 1 )+ 1 ) t where Tr(n + 1) denotes the number of 
primes < n + 1; see fiM Theorem 30.4]. Since <S„+i C GL„(C), as explained in the proof of Proposition^ 
one must certainly have j(n) > (n + 1)! for n > 4 . In his near-complete manuscript |45|, Weisfeiler comes 
close to proving that equality holds for large enough n: he shows that if n > 63 then j(n) < (n + 2)1. In 
|46|, Weisfeiler announces the weaker upper bound j(n) < n alogn+b nl. Quite recently, Michael Collins 
1 10 1 was able to settle the problem by showing that for n > 71 we do indeed have j(n) = (n + 1)1 and, if 
this bound is achieved by Q, then Q modulo its center is isomorphic to S n +\. 

6.1.3. Analogs of Jordan's Theorem for linear groups in characteristics p > were established by 
Weisfeiler |45|, |46|, Larsen and Pink |30|, and Collins |9|. While both Weisfeiler and Collins rely on the 
classification theorem, Larsen and Pink prove a noneffective version of Jordan's theorem, without explicit 
index and degree bounds, by using methods from algebraic geometry and the theory of linear algebraic 
groups instead. We will explain Collins' modular version of Jordan's Theorem. As usual, O p (Q) denotes the 
maximal normal p-subgroup of the finite group Q. Furthermore, a group is called quasisimple if it is perfect 
and simple modulo its center. Collins' result then reads as follows. 

THEOREM 20 (Collins 2005). Let F be afield of positive characteristic p and let Q be a finite subgroup 
of GL n {F), where n > 71. Put G = Q /O p {Q). Then G has a normal subgroup N such that 

(a) N = AQi . . . Q m , a central product with A abelian and the Q t (quasi)simple Chevalley groups in 
characteristic p. 
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(b) [G:N}< 



(n + 2)! divides n - 
(n + 1)! otherwise. 



6.2. The Minkowski sequence M(n). A search of Sloane's On-Line Encyclopedia of Integer Se- 
quences |42|, by entering the first six terms 2, 24, 48, 5760, 11520, 2903040 of M(n), turns up a sequence 
labeled A053657. This sequence has two additional descriptions besides Minkowski's description of M (n) 
as the least common multiple of the orders of all finite subgroups of GL„(Q); the other two will be given 
below. We know of no direct argument explaining the (proven) equivalence of M(n) to the first sequence 
below. The equivalence of the second sequence to M{n) is currently supported only by empirical evidence. 

• By Chabert et. al. [7|, the collection of all leading coefficients of polynomials f(x) € Q[x] of 
degree at most n such that f(j>) S Z holds for all primes p is a fractional ideal of the form jjjjyZ 
for suitable positive integers a(n). It turns out that formula Q is identical with the formula given 
in Q Proposition 4.1] for a(n + 1). Thus M(n — 1) = a(n). 

• Following Paul Hanna |42 A075264], we let P(n,z) denote the coefficient of x n in the Taylor 
series for (^Miz5l)« at x = 0. Thus, £~ =1 ( = £^ P(n, z)x n with £ = - 
1 = ST=i F^T and (*) = '('-D-^+D , For example, P(l, z) = f, P(2,z) = 

P(3, z) = 62:+5 / 8 +z ■ In general, P(n, z) £ zQ[z]; the polynomials P(n, z) for n < 8 are listed 
in sequence A075264 of OEIS |42|. Paul Hanna has noted that the denominator of P(n, z), that is, 
the positive generator of the ideal {q E Z | qP(n, z) 6 Z[z]}, appears to coincide with M(n). 
In 1 33 1, Minkowski states the following recursion for the sequence M(n); the recursion is easy to check 
from Q: 

(24) M(2n + 1) = 2 M(2n) and M(2n) = 2 M(2n - 1) J| pn p . 

p : j>— 1 |2n 

The product in (1241 ranges over all primes p such that p — 1 divides 2n, and n p denotes the p-part of n, as 
usual. This product has an interpretation in terms of the familiar Bernoulli numbers B n which are defined by 
= J2^Lo ^"fj- ^ n ^ act ' B n — for odd n > 1 while B>2 n is a rational number whose denominator, 
when written in lowest terms, is given by the von Staudt-Clausen theorem: it is equal to J\ p . p _ 1 1 2n p ; cf . J5| 
Theorem 1 ] . Moreover, for each prime p such that p — 1 does not divide 2n, the numerator of B^n is divisible 
by the p-part n p ; see Theorem 5]. Consequently, the product p-i^n?" 1 ? m ^4> is equal to the 
denominator of . This was already pointed out by Minkowski in |33|. Finally, the asymptotic order of 
M n has been determined by Katznelson |24|: lim,^^ (M(n)/n\) 1/n = l\ P P 1/(p ~ 1)2 ~ 3.4109. 

Acknowledgment. The authors wish to thank Boris Datskovsky and Ed Letzter for their comments 
on a preliminary version of this article, and Michael Collins for making drafts of |8|,|9|,|10| available to 
them. 
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